IMPROVING THE NORMALIZED IMPORTANCE SAMPLING ESTIMATOR
نویسندگان
چکیده
منابع مشابه
Policy Improvement for POMDPs Using Normalized Importance Sampling
We present a new method for estimating the expected return of a POMDP from experi ence. The estimator does not assume any knowledge of the POMDP, can estimate the returns for finite state controllers, allows ex perience to be gathered from arbitrary se quences of policies, and estimates the return for any new policy. We motivate the estima tor from function-approximation and impor tance sa...
متن کاملImproving MCMC, using efficient importance sampling
This paper develops a systematic Markov Chain Monte Carlo (MCMC) framework based upon E cient Importance Sampling (EIS) which can be used for the analysis of a wide range of econometric models involving integrals without an analytical solution. EIS is a simple, generic and yet accurate Monte-Carlo integration procedure based on sampling densities which are chosen to be global approximations to ...
متن کاملImproving MCMC Using E cient Importance Sampling
This paper develops a systematic Markov Chain Monte Carlo (MCMC) framework based upon E cient Importance Sampling (EIS) which can be used for the analysis of a wide range of econometric models involving integrals without an analytical solution. EIS is a simple, generic and yet accurate Monte-Carlo integration procedure based on sampling densities which are chosen to be global approximations to ...
متن کاملThe Self-Normalized Estimator for Counterfactual Learning
This paper identifies a severe problem of the counterfactual risk estimator typically used in batch learning from logged bandit feedback (BLBF), and proposes the use of an alternative estimator that avoids this problem. In the BLBF setting, the learner does not receive full-information feedback like in supervised learning, but observes feedback only for the actions taken by a historical policy....
متن کاملThe Convergence Rate and Asymptotic Distribution of the Bootstrap Quantile Variance Estimator for Importance Sampling
Importance sampling is a widely used variance reduction technique to compute sample quantiles such as value at risk. The variance of the weighted sample quantile estimator is usually a difficult quantity to compute. In this paper we present the exact convergence rate and asymptotic distributions of the bootstrap variance estimators for quantiles ofweighted empirical distributions. Under regular...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Probability in the Engineering and Informational Sciences
سال: 2012
ISSN: 0269-9648,1469-8951
DOI: 10.1017/s0269964812000198